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ABSTRACT 

The macrocyclic bis-naphthalene macrocycle 
(2,7-BisNP), belonging to the cyclobisintercalator 
family of DNA ligands, recognizes T-T mismatch 
sites in duplex DNA with high affinity and selectivity, 
as evidenced by thermal denaturation experiments 
and NMR titrations. The binding of this macrocycle 
to an 11-mer DNA oligonucleotide containing a T-T 
mismatch was studied using NMR spectroscopy 
and NMR-restrained molecular modeling. The 
ligand forms a single type of complex with the 
DNA, in which one of the naphthalene rings of the lig- 
and occupies the place of one of the mismatched 
thymines, which is flipped out of the duplex. The 
second naphthalene unit of the ligand intercalates 
at the A-T base pair flanking the mismatch site, 
leading to encapsulation of its thymine residue via 
double stacking. The polyammonium linking chains 
of the macrocycle are located in the minor and the 
major grooves of the oligonucleotide and participate 
in the stabilization of the complex by formation of 
hydrogen bonds with the encapsulated thymine 
base and the mismatched thymine remaining 
inside the helix. The study highlights the uniqueness 
of this cyclobisintercalation binding mode and its 
importance for recognition of DNA lesion sites by 
small molecules. 

INTRODUCTION 

Mismatched, or non-complementary, base pairs in 
DNA are permanently formed in living cells due to 



misincorporation of nucleotides during DNA synthesis, 
inclusion of chemically damaged nucleotides, or inclusion 
of normal nucleotides opposite damaged bases in the 
DNA template strand (1). The rate of generation of 
DNA mismatches is greatly increased by exogenous 
factors, such as genotoxic chemicals or UV radiation. 
DNA mismatches disrupt the genetic information and 
are therefore highly toxic for the organism. As a conse- 
quence, all organisms possess enzyme systems responsible 
for the detection and repair of damaged DNA (DNA 
repair systems) (2,3). The primary task of the DNA 
repair system, i.e. detection and excision of damaged 
and mismatched bases from the DNA, is met by the 
DNA glycosylases that locate the damaged or 'wrong' 
nucleobases embedded in the massive excess of undam- 
aged genome much like a needle in a haystack. The mech- 
anism, by which DNA glycosylases accomplish this 
formidable task, extremely efficiently and rapidly, is still 
a matter of debate (1,4-9), but ultimately its deeper under- 
standing will allow us to control the DNA repair processes 
in normal and tumor tissues. 

Notably, modern cancer therapy is largely based on the 
agents (drugs or radiation) killing cancer cells more effi- 
ciently than normal cells, and DNA-damaging drugs rep- 
resent the primary therapy in most cases. However, due to 
the activity of the DNA repair system, a large part of the 
drug-induced lesions is repaired before they cause cell 
death and stop proliferation. Therefore, the efficacy of 
the DNA damage-based cancer therapy can be increased 
by inhibition of such repair pathways, and DNA repair 
enzymes represent a topic of an immense interest in this 
context (10-15). Apart from a few known inhibitors 
of DNA repair enzymes such as APE1 (16-20), PARP 
(21-23) or UDG (24,25), it may be assumed that small 
molecules, interfering with repair enzymes by binding to 
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their substrate, i.e. DNA lesion sites, could lead to modu- 
lation of DNA repair pathways and enhance of the 
anti-cancer activity of DNA-damaging agents. 

Along these lines, a few families of small molecules that 
selectively bind to DNA lesions, e.g. to mismatched base 
pairs in the DNA, have been described during the last 
decade. Among them, derivatives of naphthyridine have 
been shown to bind to G-G, G-A, A-A or C-C mis- 
matched base pairs by insertion into the duplex and for- 
mation of hydrogen bonds with the mismatched and 
neighboring bases (26-30). The solution structure of a 
complex of naphthyridine-azaquinolone dimer with an 
oligonucleotide having A-A mismatch in the CAG/CAG 
context was characterized by NMR; it features the binding 
of two molecules of ligand in the mismatch site, accom- 
panied by ejection of two flanking cytosine residues from 
the duplex (31). The second important class of mismatch 
binders is represented by bulky metal complexes, such 
as [Rh(bpy) 2 (chrysi)] 3+ and [Co(phen) 2 (HPIP)] 3+ , which 
do not form hydrogen bonds with the nitrogenous 
bases, but insert into mismatched and abasic sites taking 
advantage of their higher accessibility, as compared to 
the Watson-Crick base pairs (32-37). Structures of 
[Rh(bpy) 2 (chrysi)] 3+ bound to A-C and A-A mismatches 
were determined in the solid state using X-ray diffraction 
analysis (38,39), whereas the structure of a complex with a 
C-C mismatch was recently characterized in solution 
using NMR spectroscopy (40). Notably, in all three 
cases, the binding of the metal complex to the mismatch 
site leads to ejection of both mispaired bases out of the 
stack. However, intercalation of [Rh(bpy) 2 (chrysi)] 3+ 
between two well-matched base pairs was also observed 
in the solid state (38). Most importantly from the 
therapeutic point of view, mismatch-selective metal com- 
plexes inhibit the growth of mismatch-repair-deficient 
cancer cell lines, and not that of mismatch repair- 
proficient ones, which supports the hypothesis that recog- 
nition of base mismatches by small molecules and 
interference with the DNA repair enzymes is possible in 
cells as well (41,42). 

In another approach, we have shown that certain 
macrocyclic ligands containing two aromatic planes con- 
nected by two flexible polyamine chains (also termed 
cyclobisintercalators, or CBI) (43) can selectively bind to 
DNA lesions such as abasic sites (44,45) and homo- 
pyrimidine mismatches (46,47). The initial studies were 
performed with the 2,7-substituted bis-acridine (BisA) 
and 2,6-substitued bis-naphthalene (2,6-BisNP) 
macrocycles. Later, however, screening of a panel of 20 
homo- and heterodimeric macrocyclic ligands led to the 
identification of the 2,7-substituted bis-naphthalene 
isomer (2,7-BisNP, Scheme 1) as the ligand with the 
highest binding selectivity for mismatched versus fully 
matched DNA duplexes (48). The selective binding of 
this family of ligands to DNA lesion sites is, presumably, 
due to their peculiar constrained conformation and the 
short distance between the two aromatic planes, which 
does not allow their simultaneous intercalation into 
regular DNA, as it would violate the neighbor-exclusion 
principle (49). However, it appears that at sites of weaker 
thermodynamic stability, such as mismatches and abasic 




5'- Ci G2 T3 C4 G5 T6 A7 Gs Tg G10C11 -3' 

3- G22C21 A20 G19C18 T17 T16 C15 A14 C13 Gl2"5' 

Scheme 1. (a) Chemical structure and proton numbering scheme of 
2,7-BisNP. (b) Sequence and numbering of the TT-DNA duplex. 
The T-T mismatch is highlighted in boldface. 

sites, this principle can be violated, allowing the highly 
efficient binding of cyclobisintercalators. In fact, the 
binding of the cyclobisacridine BisA to a DNA duplex 
with an abasic site has been characterized by a 
high-resolution NMR and molecular modeling study, 
which showed that one of the acridine units of BisA 
occupies the abasic pocket, while the second one intercal- 
ates at the neighboring base pair step, in a threading 
bisintercalation fashion (45). Thus, we speculated that 
binding of 2,7-BisNP to T-T mismatch may proceed in 
a similar fashion. In addition, it was postulated that one 
of the mismatched thymines may be extruded (flipped out) 
from the helix, as this would be in agreement with their 
higher susceptibility to permanganate oxidation observed 
upon binding of CBI macrocycles (46). However, the 
structural details of the recognition event have not been 
available, so far. 

In this study, we report the NMR characterization and 
molecular modeling of 2,7-BisNP binding to an 11-mer 
oligonucleotide duplex containing a thymine-thymine 
mismatch target (TT-DNA, Scheme 1). The central part 
of TT-DNA (5'-TCG T AGT-3'/5'-ACT T CGA-3') corres- 
ponds to that of 17-mer duplex, used in our previous 
studies (46,46); the terminal GC base pairs were added 
to increase the thermal stability of the short duplex. 
Moreover, the asymmetry of the binding site with 
respect to the neighboring base pairs (G-C versus A-T) 
allowed us to hope that one type of ligand-DNA complex 
would be present exclusively, unlike in the case of 
BisA-abasic site recognition, where the symmetry of the 
binding site (two flanking G-C pairs) gave rise to two 
complexes (45). In addition, the solid-state conformation 
of 2,7-BisNP alone was obtained from X-ray diffraction 
analysis and served for comparison with the one observed 
upon binding to the DNA mismatch in solution. 

MATERIALS AND METHODS 

Solid-state structure of 2,7-BisNP 

The synthesis and characterization data of 2,7-BisNP x 
4HC1 were reported previously (48). Crystals suitable 
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for X-ray diffraction analysis were obtained by vapor 
diffusion of Et 2 0 or /-BuOMe into solutions of 
2,7-BisNP x 4HC1 in MeOH. A satisfactory dataset was 
obtained with a crystal grown form a MeOH-Et 2 0 
solvent pair. Diffraction data were collected by using 
a Bruker Kappa X8 APEX II diffractometer with 
graphite-monochromated Mofe radiation (X = 0.71073 
A) at 100 ± 1 K. The structure was solved by direct 
methods using SHELXS-97 and refined by full-matrix 
least-squares techniques using SHELXL-97 with aniso- 
tropic displacement parameters for all non-hydrogen 
atoms. X-ray data for 2,7-BisNP x 4 HC1 x 4 MeOH 
(CCDC-847671): CseHeoCLJ^Og, M = 786.68 gmol -1 , 
space group P2Jc (no. 14), a = 32.786 6(9), b = 5.524 80 
(10), c = 23.0566(6) A, a = y = 90°, p = 105.943 0(10)°, 
V = 4015.81(17) A 3 , Z = 4, fi = 0.342mm -1 , F(000) = 
1680, crystal size 0.20 x 0.1 1 x 0.06mm 3 . A total of 
98 941 reflections (1.29° < G< 33.30°) were collected, 
of which 15 469 were unique (R int = 0.0470) and 10 758 
observed [I>2a(I)] were used in calculations. The 
structure was refined on F 2 to R = 0.0820 [I>2a(I)] 
for 483 refined parameters and 6 restraints. Largest 
difference peak and hole 1.741 and —1.264 eA~ 3 . 

NMR studies 

Sample preparation. The oligonucleotides (purified 
by HPLC and ion exchange chromatography) were 
purchased from Eurogentec (Belgium). The two strands 
d(CGTCGTAGTGC) and d(GCACTTCGACG) were 
dissolved and mixed in 500 ul of 10 mM sodium phosphate 
buffer (pH 6) containing 50 mM NaCl, heated to 80°C and 
slowly cooled to room temperature to form the duplex. 
The sample was lyophilized once from water, twice from 
99.9% D 2 0 and then redissolved in 500 ul of 99.99% D 2 0. 
For NMR experiments involving exchangeable protons, 
the oligomers were dissolved in 90% H 2 O/10% D 2 0 
(v/v). The final duplex concentration was about 1.5 mM. 
For the 2,7-BisNP-TT-DNA complex, the ligand was 
dissolved in the NMR buffer at 19mM and added pro- 
gressively to the free DNA in the NMR tube until a 1:1 
complex was obtained, as confirmed by ID NMR 
spectroscopy. 

NMR experiments. 'H NMR experiments were carried 
out with a Varian Unity Plus 500 MHz spectrometer or 
a Varian Unity 800 MHz spectrometer equipped with a 
cryo-probe. Chemical shifts were calibrated relative to 
sodium 3-(trimethylsilyl)propionate-2,2,3,3-<i4 (TSP-t/ 4 ). 
Data were processed with VNMR software (Varian) and 
displayed and analyzed with the CCPNMR program (50). 

The NOESY spectra of the free DNA and of the ligand- 
TT-DNA complex were recorded in D 2 0 buffer at 10°C 
using mixing times of 75, 100, 150 and 250 ms. The 
TOCSY spectra were recorded with a spin-lock time of 
40 and 60 ms using a M LEVI 7 pulse train of 8 kHz for 
the free DNA and with a spin-lock time of 60 or 110 ms 
for the complex. The experiments were recorded with 512 
{t{) x 2048 (t 2 ) complex data points. Ninety-six scans were 
acquired over a spectral width of 5000 Hz (for experiments 
at 500 MHz) or 64 scans were acquired over a spectral 



width of 8000 Hz (for experiments at 800 MHz) in both 
dimensions. Water resonance was suppressed by a 
standard presaturation during the relaxation delay and 
during the mixing time for NOESY. The DQFCOSY 
spectrum was run with 2048 points in t 2 , 256 points in 
t\, and 96 scans for each /[ value. For a sample in 90% 
H 2 0, NOESY spectra were acquired at 10 or 5°C using 
the sculpting gradients scheme with selective pulses of 
3-4 ms for water suppression. The mixing time was 
set to 350 ms. The experiments were recorded with 
340 (ti) x 4092 (t 2 ) complex data points and 160 scans 
for each FID. ID proton-decoupled 31 P spectra and 
2D 'H- 31 P HSQC were recorded at 10°C at 
202 MHz; chemical shifts were referenced to trimethyl 
phosphate. 

For the free 2,7-BisNP, ID NMR spectra, 2D NOESY 
with mixing time of 0.8 s and DQFCOSY were performed 
in a D 2 0 buffer containing 10 mM sodium phosphate 
(pH6) and 50mM NaCl at 10° C. 

Distance restraints and dihedral angles. The NOE 
intensities measured from the NOESY spectra in D 2 0 
(mixing time of 150 ms) were converted into distance re- 
straints in the CCPNMR program and calibrated to the 
fixed reference cytosine H5-H6 = 2.45 A. The lower and 
upper boundaries were set to 0.8 and 1.2 multiples of 
calculated distances, respectively. For dihedral angles re- 
straints, a qualitative analysis of the DQFCOSY 
cross-peaks intensities was used to evaluate the approxi- 
mate values of the / H i'-H2'> Jht-hv and Jny-w coupling 
constants and deduce the pseudorotation angles P. The P- 
values were then used in the Antechamber module 
(AMBER 10.1 software, University of California, San 
Francisco) to generate the sugar pucker restraints. For 
the ligand-DNA complex, intermolecular NOEs between 
the ligand and DNA were measured and classified 
semi-quantitatively as distances into three categories: 
strong (2.0-3.0 A), medium (3.0-4.0 A) and weak 
(4.0-5.0 A). 

Molecular modeling 

The B-DNA duplex was built using the Biopolymer 
module of Biosym Insight II software, saved as a pdb 
file and exported into xLEaP module of the AMBER 
program. Sodium counterions were added. Structure cal- 
culations of the free DNA and of the ligand-TT-DNA 
complex were performed with the SANDER module of 
AMBER10.1 using the ff99SB force field. A 2fs time 
step was used for all the calculations, and the SHAKE 
algorithm was applied to remove bond stretching. The 
tetraprotonated 2,7-BisNP ligand was constructed in 
Insight II and minimized with the AMBER force field, 
which lead to a U-shaped conformation with a near- 
parallel orientation of naphthalene units. The partial 
charges were calculated using HF/6-31G* (Gaussian 03 
Rev. A.l, Gaussian, Inc., Pittsburgh, PA, USA) and the 
RESP module in AMBER. A new AMBER library con- 
taining the ligand parameters was then created. 

Structure of the free TT-DNA. Five random starting 
structures were prepared using 3-5 ps unrestrained 
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molecular dynamics at elevated temperature ranging from 
600 to 1000 K. Each of these structures was subjected to 
100 ps restrained simulated annealing protocol with 
implicit solvent (Born solvation model) using 232 experi- 
mental interproton distances, 100 sugar dihedral angles, 
91 backbone torsion angles which maintained the 
right-handed character of the helix and 27 angles which 
preserved base pairing as restraints. The temperature was 
rapidly increased to 400 K and maintained over the first 20 
ps. The system was then gradually cooled from 400 to 
100 K for 70 ps and finally to OK for the last 10 ps. The 
TAUPT parameter was set to 0.4 ps for the first 20 ps, and 
then during cooling it was increased to 4.0 ps. From 
90 to 95 ps, TAUPT was decreased to 1 ps and to 0.1 ps 
for the last 5ps. The weight of NMR restraints (REST 
parameter) increases from 0.1 to 1 during the first 10 ps 
(3.2 to 32kcalmol _I A -2 for distance restraints and 
50kcalmol _1 rad -2 for torsion angle restraints) and kept 
at this value until the end of the simulation. The five final 
structures were averaged and solvated in a periodic TIP3 
water box that extended to a distance of 5 A from any 
solute atom (approximate dimensions 35 x 35 x 50 A, 
1766 water molecules). One thousand steps of minimiza- 
tion of the solvent and counterions only were applied; the 
whole system was then subjected to 5000 steps of mini- 
mization. A 20 ps NVT dynamics were then performed at 
300 K on the water only (with the DNA and sodium ions 
weakly constrained) before 100 ps of NPT dynamics at 
300 K were performed on the whole system without 
NMR restraints, and then 100 ps of NPT dynamics at 
300 K with NMR restraints. The structures over the last 
5ps were analyzed. The structures have been deposited to 
the Protein Data Bank (code 2LL9). 

Structure of the 2,7-BisNP-TT-DNA complex. The struc- 
ture of the free TT-DNA obtained from the above 
protocol was modified in the Biopolymer module of 
Biosym Insight II software to create an intercalation site 
at the A 7 pGs and CispT^ steps. The backbone angles of 
the T 6 pA 7 pG8 and Ci5pTi 6 pT 17 steps were manually 
changed, and T 6 and T 17 were pushed toward the major 
groove. 2,7-BisNP was manually docked into the DNA. 
The complex was solvated with a new periodic TIP3 
water box extended to a distance of 4.5 A from any 
solute atom (approximate dimensions 40 x 40 x 55 A, 
2400 water molecules). One thousand steps of minimiza- 
tion of the solvent and counterions only were applied, 
and then the whole system was subjected to 5000 steps 
of minimization using NMR restraints with a force 
constant of 32kcalmol _1 A -2 . Then 20 ps of equilibration 
followed by 40 ps of simulated annealing molecular 
dynamics were performed. No force constants were 
introduced to ensure the A 7 -T 16 base pairing. The final 
structure has been deposited to the Protein data bank 
(code 2LLJ). 

DNA melting experiments 

Thermal denaturation studies were performed in triplicate 
according to the previously publishing protocol in lOmM 
sodium cacodylate buffer containing 50 mM NaCl, pH 6 



(48). The concentration of duplex TT-DNA or TA-DNA 
(5'-CGT CGT AGT GC-3'/5-GCA CTA CGA CG-3') 
was 2 uM, and the concentration of ligand was varied 
from 0 to 8 uM. 

RESULTS 

Structure of 2,7-BisNP in the solid state 

The preliminary X-ray diffraction data obtained with 
several crystals grown form MeOH-Et 2 0 or MeOH- 
/-BuOMe solvent pairs indicated the same space group 
and unit cell volume, which provided evidence of the iden- 
tical structure. A detailed analysis was performed with a 
crystal of 2,7-BisNP x 4 HC1 x 4 MeOH grown from 
MeOH-Et 2 0. Two crystallographically non-equivalent 
cations of the macrocycle with similar geometric param- 
eters were observed in the asymmetric unit. The structure 
(Figure la and b) reveals the non-overlapping, 
anti-parallel orientation of the naphthalene units within 
each cation. The planes of the aromatic units are 
separated by a distance of 6.6-6.7 A. Two molecules of 
MeOH are inscribed in the macrocycle, hold by the 
hydrogen bonds between the OH groups, chloride ions, 
and the protonated amino groups of the side chains of 
2,7-BisNP, as well as by van der Waals contacts between 
the methyl groups and naphthalene units. Two other 
solvent molecules and chloride ions reside in the intermo- 
lecular cavities (Figure lc). Importantly, the naphthalene 
units do not participate in intra- or intermolecular 
7i-stacking interactions. The structure is hold together by 
a network of hydrogen bonds between chloride anions, 
methanol molecules and protonated amino groups of the 
side chains. 

Although the anti-parallel conformation of 2,7-BisNP 
in the solid state does not feature a confined space between 
the naphthalene units, it may be expected that in solution 
the macrocycle is more dynamic giving rise to numerous 
possible conformers. Indeed, a molecular dynamics study 
of 2,7-BisNP in a water box identified several conformers 
differing mainly by a more or less open orientation of their 
naphthalene units (48). In addition, in a previous X-ray 
structural study, the amphi isomer (2,6-BisNP) was found 
to adopt a semi-closed conformation with a parallel orien- 
tation of the two aromatic units upon binding of an 
aromatic dicarboxylate guest (51). It is therefore clear 
that the conformation of the macrocycle, free or in a 
complex, is influenced both by the attachment position 
of the two linkers on the aromatic moieties (48) and by 
the nature of the guest trapped inside the macrocyclic 
scaffold, which points out to the ability of this class of 
macrocycles to adapt to a large variety of guests. 

DNA melting experiments 

To confirm the selective binding of 2,7-BisNP to the 
1 1-mer TT-DNA and get a rough idea of the binding stoi- 
chiometry and affinity, thermal denaturation experiments 
were performed at variable ligand-to-duplex ratios. In 
conditions comparable to those of the NMR experiments 
([Na + ] = 60mM, pH6), the duplex TT-DNA alone 
denatured at T m = 33.0°C. The addition of 2,7-BisNP 
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Figure 1. Structure of 2,7-BisNP x 4HC1 in the solid state, (a) ORTEP 
view along the crystallographic b axis of 2,7-BisNP with the enclosed 
solvent molecules and anions. The thermal ellipsoids of the 
non-hydrogen atoms are shown with 30% probability. The oxygen 
atoms of the side chains occupy two alternate positions; the ones 
with the higher occupancy factor (0.83) are shown, (b) ORTEP view 
along the crystallographic a axis without solvent molecules and anions, 
(c) Crystal packing (CPK coloring scheme). 
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Figure 2. Melting temperatures of TT-DNA (filled circles) and 
TA-DNA (empty cirlces) in the absence and in the presence of 
2,7-BisNP (0-4 equivalents). Conditions: f dna = 2 uM in cacodylate 
buffer, pH6.0, [Na + ] = 60 mM. 



led to significant stabilization of TT-DNA, as demon- 
strated by large increase of the melting temperature 
(Figure 2). The maximal effect was observed at a 1:1 
ligand-to-DNA ratio (AT m = 13.3°C), whereas at higher 
ligand loadings, the stabilization effect leveled off and 
slightly decreased to 12.3°C. This minor decrease is attrib- 
utable to weak binding of the excess ligand to the 
single-stranded form, which may slightly counterbalance 
the strong stabilizing effect induced by duplex binding. At 
a ligand-to-DNA of 0.5:1, a broad melting curve was 
observed, indicative of incomplete saturation of the 
binding site (52). 

To confirm the selective binding of 2,7-BisNP to mis- 
matched DNA, the same experiment was performed with 
the fully paired duplex TA-DNA. The denaturation 
profile of TA-DNA was hardly influenced by the 
presence of 2,7-BisNP (Figure 2), and only at high 
ligand loadings, a decrease in melting temperature was 
observed (AT m = -2.4°C at a 4:1 ligand-to-DNA ratio), 
indicative of weak binding to the single DNA strands 
as described for analogous macrocycles (53,54). 
Interestingly, melting temperature of the free duplex 
TA-DNA (T m = 45.0°C) is very close, within the experi- 
mental error, to the melting temperature of TT-DNA in 
the presence of one molar equivalent of ligand (46.2°C), 
indicating that binding of 2,7-BisNP leads to stabilization 
of the T-T-mismatched duplex to the level of a regular 
T-A duplex. Altogether, thermal denaturation experi- 
ments speak for the preferential binding of the macrocycle 
at the T-T base pair site. 

Structure of TT-DNA in the absence of ligand 

Proton and 31 P assignments. The base and sugar protons 
of TT-DNA were identified on the basis of analysis of 
2D NOESY, TOCSY and DQFCOSY spectra using the 
sequential assignment strategy, well described for 
right-handed DNA duplexes (55). All cross-peak 
intensities were consistent with a B-like DNA conform- 
ation and the bases adopting an a«?/-conformation. 
Qualitative analysis of the proton-proton coupling con- 
stants in the DQFCOSY spectra (no detectable H3-H4' 
and H2"-H3' cross-peaks corresponding to very small 
Jh3'-u4' and /h2"-h3' coupling constants, but characteristic 
Hl'-H2' pattern) along with intensities in the NOESY 
spectra indicate that all the sugars, except Q, Ci 5 and 
G 2 2, are in a C2'-endo conformation. For these sugars, a 
P-value in the range 162-180° was used. Q, Ci 5 and G 2 2 
showed large Zh3'-h4' coupling constant and small but de- 
tectable /h2"-h3s corresponding to a pseudorotation phase 
angle P ranging from 90 to 120°. For the others sugars, a 
P- value in the range 162-180° was used. 

Around the mismatched base pair T 6 -T 17 , the bases are 
well-stacked and both T 6 and T 17 remain inside the helix, 
as confirmed by the sequential NOE connectivities 
observed between T 6 H6 and G 5 H1', G 5 H2' and G 5 H2", 
A 7 H8 and T 6 H1', T 6 H2' and T 6 H2", T 17 H6 and T 16 H1', 
T 16 H2' and T 16 H2", C 18 H6 and T 17 H1', T 17 H2' and 
T 17 H2", as well as characteristic H8/H6 cross-peaks 
(G 5 H8/T 6 H6, T 6 H6/A 7 H8, T 16 H6/T 17 H6, T 17 H6/d 8 H6). 
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' 1 ' 1 [ ' 1 ' 1 I ' ' ' 1 I 1 1 1 1 1 ' ' ' ' I ' ' 1 1 1 ' ' 1 ' I ' 1 ' ' 

14.0 13.5 13.0 12.5 12.0 11.5 11.0 10.5 ppm 

Figure 3. ID imino spectra of (a) free TT-DNA at 10°C and (b) 2,7-BisNP-TT-DNA complex at 5°C. T| 6 and G5, which undergo most significant 
shifts, are highlighted in bold. 



The exchangeable imino protons were assigned with the 
ID and 2D NOESY spectrum recorded in 90% H 2 0 at 10 
or 5°C. The ID spectrum shows three main sets of imino 
proton: thymine imino protons in the range 13.5— 

14.0 ppm, guanine imino proton in the range 12.8— 

13.1 ppm and two resonances at 10.56 ppm and 
10.61 ppm, corresponding to the mismatched thymines 
(Figure 3a). It is known that non-hydrogen-bound imino 
protons, such as those of bulged bases (56), bases facing 
abasic sites (57), or non- Watson-Crick base pairs like base 
mismatches (58-62) or wobble base pairs (63,64) resonate 
in this range. These resonances were assigned to the imino 
protons of T 6 and T 17 , respectively, based on the 
cross-peaks observed in the 2D spectra. However, it was 
not possible to deduce the hydrogen bonding pattern 
between T 6 and T 17 , as the imino protons of these bases 
were almost coincident. Nevertheless, we can conclude 
that the T-T mismatch does not introduce dramatic dis- 
tortions to the duplex structure, as chemical shifts of the 
DNA protons are in the common range observed for 
regular DNA. 

31 P chemical shifts were used to obtain information 
about the conformation of the phosphodiester backbone, 
as they are sensitive to the local distortions. The ID 31 P 
spectrum was recorded at 10°C (Supplementary Figure 
SI), and the signals were assigned from a 'H- 31 P HSQC 
experiment (Supplementary Table SI). The 31 P chemical 
shifts were observed in the range from —3.72 to 
— 5. 14 ppm, clearly indicating that all torsion angles are 
in the BI conformation (65). 

Structural modeling of the free TT-DNA. The structural 
analysis of the free TT-DNA was carried out using a 
simulated annealing protocol within the AMBER 10.1 
program. A set of five random conformations was 
generated to ensure that each of the starting structures 
was in different regions of the conformational space and 



that the final conformation is independent of the initial 
structure. A 100 ps restrained dynamics in implicit water 
were then performed followed by 100 ps of restrained 
dynamics in explicit solvent (see Materials and 
Methods). An ensemble of the structures obtained over 
the last 5 ps of the dynamics is shown in Figure 4. The 
DNA adopts a regular B-DNA-like structure with 
well-formed Watson-Crick alignments of all canonical 
base pairs. The bases T 6 and T 17 remain inside the helix, 
and the major and minor groove size are normal around 
the mismatch. Thus, the T-T mismatch does not introduce 
dramatic distortion to the structure, in accordance with 
what has been observed in earlier NMR studies (58-62). 
A good agreement with the NMR experimental restraints 
was obtained: 13 restraints were violated by 0.2-0.3A, 14 
by 0.3-0.5 A and four by more than 0.5 A. The mutual 
root mean square deviation (rmsd on all atoms except the 
terminal base pairs) between the last five structures from 
the dynamic are in the range 0.32-0.39 A, evidencing no 
major fluctuations between the structures and a good con- 
vergence. This successful convergence to similar conform- 
ation is indicative of the performance of the restrained 
molecular dynamics. Most significant differences are 
observed at the terminal base pairs and can probably be 
attributed to end fraying. 

Structure of the 2,7-BisNP-TT-DNA complex 

The titration of 2,7-BisNP to TT-DNA was monitored by 
ID NMR spectroscopy (Supplementary Figure S2). 
Upon the addition of the ligand new resonances 
appeared and increased while the resonances of the free 
oligonucleotide decreased, disappearing completely when 
a 1:1 ratio was reached. This indicates a slow exchange 
regime on NMR timescale. Line broadening was also 
observed, and some of the aromatic protons of the 
ligand were significantly shifted upfield. These 
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Figure 4. Structure of TT-DNA: superposition of the five last struc- 
tures emergent from the restraint molecular dynamics viewed (a) from 
the side and (b) into the major groove. The mismatched thymine 
residues T 6 and T 17 are depicted in green, hydrogen atoms are not 
shown, (c) Close-up down view of the T 6 -Ti 7 mismatch (O: red, N: 
blue, H: white, P: orange). 

observations are consistent with a specific binding and 
insertion of the ligand inside the DNA. 

Proton and 31 P assignments of the 2,7-BisNP-TT-DNA 
complex. Most of the non-exchangeable protons of the 
oligonucleotide were assigned using a combination of 
2D NOESY, TOCSY and DQFCOSY. In the NOESY, 
the observed sequential internucleotide connectivities 
H6/8 and HI' as well as H6/8 and H2'/H2" allowed the 
assignment of most of the protons. However, this sequen- 
tial walk was interrupted: no NOEs were observed 
between G 8 H8 and A 7 -sugar protons and between 
T 16 H6 and Ci5-sugar protons, which gives a clear 
evidence of intercalation of an aromatic unit between 
the base pairs A 7 -T 16 and G 8 -Ci 5 , as these NOEs were 
clearly present in the free oligonucleotide. Adenine A 7 
was difficult to identify, as its signal was not observed in 
the typical range of chemical shifts normally observed for 
adenine (downfield), but was significantly shifted upfield 
(8 = 7.88 ppm) and overlapped with five H8 or H6 base 
protons. This also undoubtedly points out to binding of 
the ligand in the neighborhood of A 7 . 

Weak NOE signals were observed between T 6 H6 and 
Gs-sugar protons, but not between Ti 7 H6 and Ti 6 -sugar 
protons. Similarly, no NOE were present between C[ 8 H8 
and T 17 -sugar protons. Unfortunately, it was not possible 
to unambiguously determine if A 7 H8 gave NOE with 



T 6 -sugar proton, as the peak of A 7 H8 was strongly 
overlapped with other protons. These observations 
suggest ejection of the mismatched thymine T 17 from the 
double helix, with the opposite T 6 remaining closer to its 
flanking bases G 5 and A 7 than T 17 does to Q 8 and T 16 . 
The intranucleotide NOEs indicate that even after binding 
of 2,7-BisNP to DNA, the bases maintain their anti-con- 
formation. The DQFCOSY cross peak pattern suggests 
that the sugars mainly adopt the C2'-endo puckering, 
even the thymines T 6 and T 17 , as it was observed for the 
oligonucleotide in the absence of the ligand. 

The imino and amino protons were assigned on the 
basis of the 2D-NOESY spectrum of the complex 
recorded in H 2 0 at 10 or 5°C. Usual correlations for 
Watson-Crick base pair were observed between G-NH 
imino protons and C-NH 2 amino protons and between 
T-NH and A-H2 protons. Imino-imino NOEs between 
flanking base pairs were also detected throughout the 
duplex. G 5 NH and T 16 NH were markedly shifted upfield 
(1.5 ppm and 1.0 ppm, respectively), which gave evidence 
for intercalation of 2,7-BisNP at the G 5 pT 6 and C 15 pT 16 
steps. The T 6 NH and T 17 NH were easily detected in the 
ID spectrum as broad signals at 10.49 and 10.56 ppm 
(Figure 3b), but could not be unambiguously assigned 
even in the 2D spectra due to the absence of correlation 
peaks. This is consistent with the fact that they are 
extruded from the duplex and more exposed to the 
solvent. 

The protons of 2,7-BisNP were assigned following 
the analysis of the NOESY, TOCSY and DQFCOSY 
datasets in D 2 0 solution. The aromatic region of the 
DQFCOSY spectrum displayed four sets of vicinal 
coupling connectivities corresponding to H1/H2, H3/H4 
and H7/H8, H9/H10. The protons HI, H4, H7 and H10 
were identified on the basis of their long-range coupling 
constant with H5, H6, Hll and H12, respectively, and 
their NOE with the methylene proton of side chains 
(H19, H13, H24 and H18). The other methylene protons 
of the linker chains were identified by combined use of 
TOCSY and NOESY cross-peak. Unfortunately, unam- 
biguous assignment of all the protons of the side chains 
was not possible due to broad signals and severe overlaps. 
Upon binding to the DNA, the methylene protons become 
non-equivalent and have different, but very similar, chem- 
icals shifts leading to numerous overlapping signals which 
were difficult to assign. 

Finally, the 31 P spectra of the 2,7-BisNP-TT-DNA 
complex (Supplementary Figure SI) show similar 
chemical shift dispersion compared to the free TT-DNA, 
indicating no major backbone deformation upon binding 
of the ligand. 

Chemical shift mapping upon binding of 2,7-BisNP to TT- 
DNA. Changes in the chemical shift upon binding repre- 
sent a valuable tool for the localization of the binding site 
of the ligand. The interaction of 2,7-BisNP with the 
TT-DNA duplex resulted in line broadening and signifi- 
cant variation of chemical shifts relative to the free DNA 
(Figure 5). As can be seen, the central part of the sequence 
T 6 pA 7 pG 8 previously identified as the possible binding site 
undergoes the most pronounced changes. Indeed, most of 
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Figure 5. Changes in 'H chemical shifts of TT-DNA (AS = 
Abound — <5f ree ) upon binding of 2,7-BisNP. Only significant changes 
(| A£| > O.lOppm) are shown. 



the protons of the base pairs, which flank the T 6 Tj 7 
mismatch are shifted upheld due to the intercalation of 
the ligand and the associated 7i-stacking effect. For 
example, A 7 H8 is shifted upheld by 0.41 ppm, T[ 6 H6 is 
shifted by 0.31 ppm, T 6 CH3 is shifted by 0.46 ppm and 
almost all the protons of Ci 5 are shifted upheld. The 
most striking effect is observed for the exchangeable 
protons: G 5 NH is shifted by —1.37 ppm, T 16 NH by 
-0.97 ppm, G 8 NH by -0.38 ppm and C, 8 NH 2 protons 
are shifted by —0.63 and —0.1 8 ppm. This provides a 
strong evidence of the insertion of the ligand into DNA. 
The large shift experienced by G 5 NH and, to a lesser 
extent, by C i8 NH 2 together with the shift of the 
non-exchangeable protons of A 7 -T 16 pair clearly reveals 
the position of one of the aromatic ring of 2,7-BisNP 
inside the helix in the plane of T 6 -T 17 . Indeed, such shift 
would be inconsistent with an intercalation occurring 
between the base pairs T 6 -T 17 and A 7 -T 16 . 

The signals of some protons were, instead, shifted 
downfield. Notably, most protons of the mismatched 
thymines T 6 and especially T 17 exhibited higher chemical 
shifts compared with the free TT-DNA, providing 
evidence of their ejection from the base stack. No 
significant changes (|A8| > O.lOppm) were detected for 
the base pairs other than C 4 -G 19 , G 5 -C 18 , T 6 -T 17 , 
A 7 -T 16 , G 8 -Ci5 and T 9 -Ai 4 , which speaks for the 
localization of the binding site in the central part of the 
duplex. The signals of the protons of 2,7-BisNP also 
undergo significant upheld shifts (A<5 up to —1.36 ppm; 
Table 1) which are characteristic of its intercalation into 
the DNA. 

(3) Intermolecular NOEs between 2,7-BisNP and 
TT-DNA. The assignment of the ligand and DNA 
protons provided the basis for identification of the 
ligand-DNA contacts, which gave insight into the mode 
of binding of 2,7-BisNP. Twenty-three specific intermo- 
lecular ligand-TT-DNA contacts (Table 1) were detected 
in the 2D NOESY spectrum (Figure 6), which allowed, in 



combination with the observations described previously, 
to identify more precisely the location of the binding site 
and the binding mode. Numerous NOE contacts were 
observed between the two aromatics cores of 2,7-BisNP 
and the Ti 6 base, consistent with the 'sandwiching' of T 16 
between the aromatic planes of the ligand (Figure 7). 
Some NOEs were also detected between one naphthalene 
ring (protons HI and H2) and the sugar protons of T 17 . 
NOEs between A 7 Hl'and the side-chain protons CH 2 -24 
indicate that this chain lies in the minor groove. On the 
contrary, the NOE observed with the other side chain, 
namely between CH 2 -18 and T 16 CH 3 , requires the 
second chain to be placed in the major groove. This is 
only possible if the binding of 2,7-BisNP occurs via a 
threading intercalation mode, with the linking chains oc- 
cupying both the minor and the major grooves. The NOE 
cross-peaks between H5 and H6 of the ligand and the 
imino proton of the guanine (G5NHI) as well as 
between H5 and the cytosine (Ci 8 H5) flanking the 
mismatch site give another evidence of insertion of the 
ligand at the level of T 6 -T 17 base pair. 

Structure of the 2,7-BisNP-TT-DNA complex from 
restrained molecular dynamics calculations. The 23 
intermolecular distance restraints determined from the 
NOESY spectra were used for molecular dynamics calcu- 
lation in order to obtain a model of 2,7-BisNP binding to 
TT-DNA that would agree with NMR data. The structure 
obtained after 40 ps of dynamics in explicit water is dis- 
played in Figures 7 and 8. It shows the threading of the 
macrocyclic ligand through TT-DNA and the specihc 
binding around the central thymine-thymine mismatch 
site. One of the naphthalene rings of 2,7-BisNP faces T 6 
and occupies the place of T17, which is extruded toward 
the major groove. This naphthalene is almost perpendicu- 
lar to the main axis of A-^Tig base pair, which allows a 
good stacking with T 16 , but not with A 7 (Figure 8c). The 
T 6 remains stacked with the flanking G 5 residue. The 
sandwiched base pair A 7 -T 16 forms a buckle of around 
38° to allow the ligand to snugly ht inside the helix. The 
second naphthalene ring of 2,7-BisNP also stacks with 
T 16 , leading to the 'sandwiching' of the latter in the 
cavity delineated by the two naphthalene units. The side 
chains of 2,7-BisNP are located in the middle of each 
groove. Stabilization of the complex occurs, in addition 
to the aforementioned jr-stacking interactions, due to for- 
mation of the hydrogen bonds (Figure 8d) between the 
protonated amino groups of the ligand and carbonyl 
groups of thymines T 6 and T 16 (C13-NH 2 + and T 6 04, 
C19-NH 2 + and T 16 02, C24-NH 2 + and T 16 02). In particu- 
lar, the hydrogen bond with T 6 may explain the specihcity 
of the ligand for T-X mismatch, while the two hydrogen 
bonds with the 'encapsulated' thymine T 16 largely contrib- 
ute to the stabilization of the complex. In addition, four 
hydrogen bonds between the positively charged amino 
groups and water molecules were detected (66). One 
of the water molecules mediates the bonding between 
C18-NH 2 + and T 16 04, while the other one mediates the 
interaction between C19-NH2 and two phosphate groups 
(pTi 6 p)- 
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Table 1. 


'H chemical shifts (ppm) of 2,7-BisNP in 


the free form and in 


the 2,7-BisNP-TT-DNA complex and observed intermolecular NOEs" 


Proton b 


°iree 


Si. 

u bound 


AS C 


2,7-BisNP-TT-DNA NOEs 


HI 


7 A2 


A 8A 
0.00 


-0.57 


T 16 HT (m), T 16 CH 3 (w), T 17 HT (m), T 17 H2' (m). T 17 H2" (m) 


H2 


7 Q 1 


A 77 
0. /Z 


-1.09 


T 16 H1' (w), T 16 CH 3 (m), T 17 H1' (s), T 17 H2' (s), T I7 H2" (m) 


H3 


7 8 1 
/ .0 1 


7 1 ^ 
/ . 1 J 


-0.66 




H4 


7 


7 7^ 
/ .Z J 


-0.18 




H5 


7 78 


O.OJ 


-1.19 


G 5 NH1 (m), C 18 H5 (w) 


H6 


7 78 


0.3V 


-1.13 


G5NHI (m) 


H7 


7 A3 


O. J7 


-0.84 


T 16 H1' (s), T 16 H2' (s), T 16 H2" (s), T 16 CH 3 (m) 


H8 


7 8 1 


A £7 
0.0 / 


-1.14 


T 16 HT (m), T 16 H2' (m), T 16 H2" (m), T 16 CH 3 (m) 


H9 


7 8 1 
/ .5] 


7 1 ^ 
/ . 1 J 


-0.66 




H10 


7 A2 


7 1 7 
/ . 1Z 


-0.31 




Hll 


7 78 


^. 8£ 
O.oO 


-0.92 




H12 


7 78 


0.4Z 


-1.36 




CH 2 -13 


4.ZZ 




-0.15/-0.06 




CH 2 -14 


3.4U 


MA 


NA 




CH 2 -15 


3.88 


3.44/3.42 


-0.44/-0.46 




CH 2 -16 


3.88 


NA 


NA 




CH 2 -17 


3.40 


NA 


NA 




CH 2 -18 


4.22 


4.07 


-0.15 


T 16 CH 3 (w) 


CH 2 -19 


4.22 


3.93 


-0.29 




CH 2 -20 


3.40 


NA 


NA 




CH 2 -21 


3.88 


NA 


NA 




CH 2 -22 


3.88 


3.22 


-0.66 




CH 2 -23 


3.40 


3.43 


0.03 




CH 2 -24 


4.22 


3.89/4.07 


-0.33/-0.15 


A 7 HT (m) 



"Chemical shifts at 10°C. 

b For the (arbitrary) proton numbering of 2,7-BisNP, see Scheme 1. 
AS = Abound — <5 free . 

s, strong; m, medium; w, weak; NA, Not assigned. 



DISCUSSION 

Thermal denaturation data strongly suggest the formation 
of a complex with a 1 : 1 stoichiometry upon binding of the 
bis-naphthalene macrocycle to the 1 1-mer duplex contain- 
ing a TT mismatch. Accordingly in the NMR-monitored 
titration, clean formation of the ligand-DNA complex 
was observed at a 1:1 ratio and no excess of the ligand 
was necessary. Finally, ID and 2D NMR data give 
evidence that only one type of complex between 
2,7-BisNP and TT-DNA is formed. 

The binding model, deduced from the numerous ligand- 
DNA NOE interactions, indicates that the macrocycle is 
located inside the double helix and sandwiches the 
thymine Ti 6 of the A-T base pair in the vicinity of the 
T 6 -T 17 mismatched base pair. Presumably, the weaker sta- 
bility of the A 7 -T 16 base pair compared with the G 5 -C 18 
pair which flanks the T-T mismatch on the other side, 
leads to formation of a single complex. It is likely that 
the hydrogen bonding between A 7 and T 16 is weakened 
due to the strong kink (38°) observed between the planes 
of these bases. The two polyammonium linkers of the 
macrocyclic scaffold lie both in the minor and the major 
grooves in a threading mode. In addition, the binding 
features two hydrogen bonds with the encapsulated 
thymine (Ti 6 02) and one hydrogen bond with one of 
the mismatched thymines (T 6 04). Both mismatched 
thymines are strongly displaced as compared to their 
position in the free duplex, especially T ]7 which is 
flipped out of the duplex toward the major groove. 
Thus, binding of the macrocycle induces a local distortion 



of the mismatch site, and interactions are observed 
over the four base pairs from C5-G18 to Gs C15. 
Altogether, the ligand-DNA complex results from a 
complex array of multiple interactions involving the mis- 
matched base pair and the three base pairs in the vicinity, 
which is based on hydrogen bonding, electrostatic inter- 
actions and Tr-stacking. It is clear that the macrocycle 
takes benefit of the local thermodynamic instability of 
the mismatch site and of the neighboring base pairs to 
enter inside the DNA and accommodate a thymine ring 
in its cavity. The binding of the ligand necessitates firstly 
the disruption of the Watson-Crick base pairing between 
A 7 and T 16 and of the wobble pairing between T 6 and Ti 7 . 
These events should be energetically compensated by the 
final stability of the complex, as evidenced by the large 
increase in melting temperature. In this regard, the three 
hydrogen bonds established between the ammonium 
groups of the ligand linkers and the carbonyl groups of 
the two thymine bases (T 6 and T J6 ) should provide signifi- 
cant contribution. Additionally, partial restoration of 
hydrogen bonding between A 7 and T 16 , which is likely 
(although not evidenced by NMR), may contribute to sta- 
bilization of the complex. Another important contribution 
is due to strong electrostatic interaction of the highly 
charged (+4) ligand and DNA phosphate backbone, 
which is highly favorable since both ligand linkers 
occupy the grooves of the DNA. We can also anticipate 
that the entropic factors also largely participate to the 
stabilization effect, since, first, binding of the ligand 
leads to release of water molecules from the DNA 
grooves and from the solvation sphere of the ligand and, 
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Figure 6. Expansion of the 250 ms NOESY spectrum of the 2,7-BisNP-TT-DNA complex in D 2 0 at 10°C. Correlations are between the aromatic 
protons and the sugar protons. Selected intermolecular 2,7-BisNP-TT-DNA cross-peaks are indicated. 




Figure 7. Structure of the complex of 2,7-BisNP (purple) with TT- 
DNA (CPK colors) from NMR-restrained molecular dynamics calcu- 
lations showing the binding site and the NOEs observed in NMR 
spectra (black lines). 



second, the ligand is preorganized due to its macrocyclic 
nature, which means low entropy cost for adaptation to 
the binding site. 

The cyclobisintercalation or pseudocatenation binding 
mode observed in the present study is remarkable in many 
aspects. 



(1) This very particular and unusual DNA binding 
behavior is evidenced here for the second time with 
a CBI macrocycle and a DNA duplex containing a 
single-base pairing defect (abasic site or a mismatch). 
Indeed, in a previous study, we have shown that a 
macrocyclic bis-acridine BisA binds via a similar 
threading double intercalation mode to an apurinic 
site with encapsulation of a pyrimidine base 
(cytosine) of the G-C base pair flanking the lesion 
(45). However, in that case, two complexes in an 
85:15 ratio were observed: in the major complex, 
both acridine units were intercalated, sandwiching 
the G-C base pair flanking the lesion, whereas in 
the minor complex only one acridine was inserted 
in the abasic pocket, the location of the second one 
being not defined. This is different in the present 
study since the asymmetry of the base pairs 
flanking the mismatch site (A-T versus G-C) led to 
formation of only one complex, through the encap- 
sulation of the more labile A 7 -T 16 base pair. Thus, 
this very particular binding mode involving double 
threading of the double helix at a base pairing 
defect with encapsulation of the neighboring base 
pair seems to be a unique feature of the CBI macro- 
cycles. Nonetheless, it should be noted that threading 
bisintercalation has also been characterized in the 
case of a macrocyclic bis-naphthalenediimide (67) 
or proposed in the case of a macrocyclic bis- 
acridine (68,69). However, both derivatives were 
built with two long linker chains (> 10 A) and were 
found to bind to regular double-stranded DNA with 
spanning over 2-4 bp. This binding behavior is in 
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Figure 8. Threading intercalation binding mode of 2,7-BisNP (purple) 
to TT-DNA from NMR-restrained molecular dynamics calculations, 
viewed (a) from the major groove and (b) from the side of the 
ftipped-out Ti 7 . (c) Top view of the naphthalene rings of 2,7-BisNP 
stacking with the encapsulated thymine T 16 . (d) Close-up view of the 
binding site (view from the minor groove, only heteroatom-bound 
hydrogen atoms are shown) showing direct and water-mediated 
hydrogen bonds (cyan sticks) between 2,7-BisNP and TT-DNA. 



On the one hand, this short distance prevents intercal- 
ation at contiguous Watson-Crick base pairs for steric 
reasons (neighbor-exclusion principle) (49); on the 
other hand, it is ideally suited for the encapsulation 
of aromatic guests, particularly nucleosides and nu- 
cleotides (70). This structural particularity governs 
both the binding preference for the T-T mismatch 
over the regular duplex and the sandwiching of the 
neighboring pyrimidine base (T 16 ) by 7i-stacking inter- 
actions. 

Interestingly, the short distance between the two 
naphthalene planes is roughly the same in the solid 
state (6.6-6.7 A). Conversely, the X-ray analysis indi- 
cates a Z-shaped conformation with antiparallel orien- 
tation of the two aromatic units, which differs from the 
almost parallel orientation of these units (U-shaped 
conformation) in the DNA-bound macrocycle as 
shown by the NMR analysis. This indicates a certain 
degree of conformational flexibility of the macrocyclic 
scaffold, resting mainly on the free rotation about the 
C ar — C and C— N bonds between the aromatic units 
and the spacers, and is consistent with previous data 
obtained from the molecular dynamics study of the 
free 2,7-BisNP in a water box (48). This is an 
evidence of the conformational adaptability of the 
bis-naphthalene macrocycle that enables fitting with 
its DNA target and is likely a key factor of the 
binding interaction. 
(3) Finally, the observed binding mode sheds light on 
the high structural deformability (plasticity) of 
duplex DNA which, albeit well-known, is often 
underestimated when designing DNA-interactive 
agents. In this regard, a comparison with the two 
other classes of mismatch-binding agents, i.e. naphth- 
yridine derivatives (31) and bulky metal complexes 
(38-40), shows that in all cases binding proceeds 
via ejection of one of the mismatched bases, as in 
the present study. Thus, all three families of ligands 
take advantage of the local thermodynamic instabil- 
ity of the mismatch sites, compared to Watson-Crick 
base pairs. However, the uniqueness of the CBI series 
is represented by their two-point binding mode (mis- 
matched base pair and intercalation pocket at the 
neighboring base pair step) and their conformational 
restrictions, which do not allow binding to fully 
paired DNA regions. In summary, the observed 
threading interaction is thus the result of a subtle 
interplay between the conformational flexibility of 
the macrocyclic ligand and the local dynamics of 
the DNA mismatched site. 



stark contrast with that of our macrocyclic bis-naph- 
thalene that does not bind fully paired duplex DNA. 
(2) As we anticipated (46-48), the characteristic DNA 
binding behavior of cyclobisintercalators studied by 
our group is obviously determined by the short 
length of the polyammonium linkers that imposes a 
constrained and short distance between the two 
aromatic moieties. Indeed, an interchromophoric 
distance of ~6.7-6.8 A can be measured for 
2,7-BisNP bound to the T-T mismatched duplex. 



CONCLUSIONS 

In the present study, we characterized the structural 
details of a bis-naphthalene macrocycle bound to a T-T 
DNA mismatch via a rare cyclobisintercalation binding 
mode, which involves encapsulation of the neighboring 
thymine base between the aromatic planes of the macro- 
cycle and ejection of one of the mispaired thymines. 
This binding mode is unique since it provides high speci- 
ficity for DNA base pairing defects with respect to 
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regularly paired DNA, and allows recognition of unpaired 
thymine due to the hydrogen bonding pattern of the 
2,7-BisNP ligand. Interestingly, the structure of the 
ligand-DNA complex obtained in the present study 
shows that the macrocyclic scaffold of the ligand allows 
the introduction of additional chemical functionalities, 
either at the linking chains or in the positions of the naph- 
thalene units which are not in close contact with the DNA, 
without perturbing the binding specificity. This might 
enable us to endow the macrocycle with reporter groups 
(fluorescent, redox units) (71) for detection of the pairing 
defect. It may be expected that this pseudocatenated inter- 
action can result in a slow dissociation of the ligand-DNA 
complex, which may dramatically increase the residence 
time of the macrocycle inside the DNA, as compared 
with classical intercalators or minor groove binders 
and interfere with the processing of mismatch repair by 
enzymes. Altogether, the present work highlights that bis- 
naphthalene 2,7-BisNP and, more largely, bisintercalators 
of the CBI type hold a great potential for therapeutic and 
diagnostic applications. 
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